Using Synonyms for Author Recognition
ثبت نشده
چکیده
An approach for identifying authors using synonym sets is presented. Drawing on modern psycholinguistic research, we justify the basis of our theory. Having formally defined the operations needed to algorithmically determine authorship, we present the results of applying our method to a corpus of classic literature. We argue that this technique of author recognition is both accurate as an author identification tool, as well as applicable to other domains in computer science such as speaker recognition.
منابع مشابه
Developing a Standardized Medical Speech Recognition Database for Reconstructive Hand Surgery
Fast and holistic access to the patients’ clinical record is a major requirement of modern medical decision support systems (DSS). While electronic health records (EHRs) have replaced the traditional paper-based records in most healthcare organization, the data entry into these systems remains largely manual. Speech recognition technology promises substitution of the more convenient speech-base...
متن کاملModel-Portability Experiments for Textual Temporal Analysis
We explore a semi-supervised approach for improving the portability of time expression recognition to non-newswire domains: we generate additional training examples by substituting temporal expression words with potential synonyms. We explore using synonyms both from WordNet and from the Latent Words Language Model (LWLM), which predicts synonyms in context using an unsupervised approach. We ev...
متن کاملA Classifier System for Author Recognition Using Synonym-Based Features
The writing style of an author is a phenomenon that computer scientists and stylometrists have modeled in the past with some success. However, due to the complexity and variability of writing styles, simple models often break down when faced with real world data. Thus, current trends in stylometry often employ hundreds of features in building classifier systems. In this paper, we present a nove...
متن کاملUsing the semantic web for author disambiguation - are we there yet?
The quality, and therefore, the usability and reliability of data in digital libraries depends on author disambiguation, i.e., the correct assignment of publications to a particular person. Author disambiguation aims to resolve name ambiguity, i.e., synonyms (the same author publishing under different names), and polysemes (different authors with the same name), and assign publications to the c...
متن کاملAutomatic Extraction from Scientific Abstracts of Synonyms for Proteins and Genes
Introduction: Protein and gene names change frequently as research reveals details about these entities. 1 Because authors often use synonyms, information retrieval requires identification of these alternate names. Many biological databases — such as GenBank 2 and SWISSPROT 3 — have synonym databases; however, the databases may not be complete. Furthermore, to our knowledge, the extraction of s...
متن کامل